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System and method for audio signal processing 



This invention relates to the field of sound reproduction, and more particularly 
to the field of digital audio signal processing. 

The invention relates to a sound reproduction system comprising a digital 
audio signal input; a digital audio signal processor and a digital audio signal output 

The invention also relates to an audio signal processor for processing an 
incoming audio signal in an audio output signal. In particular the invention relates to a digital 
signal processor (DSP) circuit or program* 

The invention also relates to a method for processing a digital audio signal 



A sound reproduction system, such as e.g. a loudspeaker telephone system, 
includes an output transducer, often called a loudspeaker, and an input for an audio signal. 
The loudspeaker produces sound pressure waves in response to the audio input signal which 
is representative of a desired sound pressure wave. 

Intelligibility of the sound as perceived by the listener is very in^ortant, 
especially in noisy environments. The simplest way to increase the intelligibility is to 
increase the average SPL (Sound pressure level), i.e. turning up the volume. However, simply 
turning up the volume does not always lead to a more intelligible sound. Also, too high an 
output may lead to an overloading of a loudspeaker resulting in a further decrease of the 
intelligibility. 

A number of attempts have been made to increase the intelligibility of sound. 

United States Patent application US 2002/0015503 proposes e.g. to increase 
intelligibility by individually constraining the gain factors for different frequency bands. 

The existing systems and methods are, however, either very complicated, 
requiring complicated computations and thereby complicated circuitry (hard-ware), or, in the 
case of a program (soft-ware) being used, a complex program, or supply only a limited 
advantage. 
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Notwithstanding the above mentioned references, there continues to exist a 
need in the art for improved systems and methods providing improved intelligibility. 



It is an object of the present invention to provide a sound reproduction system 
and method with improved intelligibilily. 

To this end, the sound reproduction system in accordance with the invention is 
characterized in that the system comprises a digital signal processor, the digital signal 
processor comprising a high pass (HP) filter with a pass frequency of between 300 Hz and 2 
kHz, a compressing amplifier (22) for compression and amplification of a signal, at least 
amplification (gain) being performed after HP filtering, and a clipper for clipping the HP 
filtered, compressed and amplified signal above a clipping level. 

The system in accordance with the invention is based on the following 

insights: 

1 The incoming signal is amplified to increase the loudness, however 

2. increasing the incoming signal could lead to a signal higher than the maximum 
digital signal, in such circumstances the signal would often be digitally clipped, leading to a 
distortion of the signaL 

3. Low frequencies are removed from the signal, this allows the remainder of the 
signal to be amplified with, on average, a higher gain factor. This is done by the HP filter, 
situated before amplification. The lower frequencies are, in so far as mteffigibility is 
concerned, of relatively low importance. The lower frequencies comprise much of the 
amplitude of the signaL so removing the lower frequencies strongly reduces the amplitude of 
the signal, creating head room for amplification, i.e. a stronger amplification for the 
remainder of the signal. A large part of the amplitude of especially speech is comprised in the 
lower frequencies so attenuating lower frequencies allows for a considerable increase in head 
room (i.e. amplification without bitting a clipping level). 

4. Simply cutting the lower frequencies and increasing the output would increase 
loud sounds as much as the more subdued sounds. Although this leads to an improvement in 
inteUigibiUty, this does not always results in a readily perceivable increase in mtewgfoility, 
especially not when the loudness range, i.e. the compressed amplitude range is large. To 
further increase inteUigibuity the device comprises a compressing amplifier for amplification 
and compression of a signal. The amplitude range is then compressed (as well as amplified). 
The clipper clips the signal above a clipping leveL the clipping level is situated within the 
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compressed amplitude range, whereby the average loudness difference is reduced. The more 
subdued sounds undergo a stronger amplification than the louder sounds (leading to a 
compression of me signal to within a compressed amplitude range) which increases 
intelligibility, because the more subdued sound become better distinguishable. This is done 
5 by the compressing amplifier. Any desirable functional form relating signal amplitude in to 
signal amplitude out may be implemented as compression relationship, e.g. with a lookup 
table. The loudest sounds are clipped by setting the clipping level within the compressed 
anmlitude range, further increasing intelligibflity. This is done by the clipper. The signal is 
clipped at a certain amplitude (clipping level) within the compressed amplitude range of 
10 which the limits, in particular the upper limit, may be set at initialization time. The 
compression action may at least in part be effectuated before or after HP filtering, the 
amplification as such (i.e. the bulk of the gain) is performed after HP filtering since it is the 
removal of the lower frequency components by the HP filtering which allows for the on 
average higher gain of the remainder (i.e. me increase in headroom). 

The concept of clipper should not be understood as limited to a hard clipping 
(i.e. all values above a predetermined value C are set equal to C), but also comprise soft 
chpping, which in a subrange up to foe high limit of the compressed amplitude range, applies 
another predetermined signal amplitude in/ signal amplitude out relationship. 

Preferably the compressing amplifier is arranged not to amplify a signal 
20 having a signal strength below a threshold value. 

Below a threshold value (a certain minimum amplitude) the signal is probably 
due to noise. Not amplifying such signals improves intelligibiHty, since the noise is reduced. 
Furthermore the difference between silence and speech is better distinguishable, which also 
increases inteUigfoflity. The threshold may be set at initialization time. 

Preferably the digital sound processor comprises a low pass filter for filtering 
the signal provided by the compressing amplifier and for providing an output signal, the pass 
frequency of the low pass filter lying in the range 2 kHz-Fs/2, where Fs is the sample 
frequency. 

The compression action, but in particular the clipping action may introduce 
disturbing overtones, which reduce foe naturalness and inteUigibility of foe emitted sound. 
Furthermore due to foe use of foe high pass filter, foe signal already comprises a relatively 
high proportion of high frequency tones. The low pass filter cuts out or at least reduces foe 
overtones produced by the compressing amplifier leading to a more natural sound and 
increasing foe intelligibility. 



25 
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The high pass filter is preferably a first order or second order filter, i.e. a filter 
with a relatively gradual slope. It is advantageous to remove much of the energy of the low 
frequency components of the incoming signal to provide head room for the amplification. 
However, a filter with a slope that is relatively steep (a step filter being the most extreme 
5 example of such a filter) removes so much of the lower frequency components that this may 
result in a unnatural sounding voice. Preferably the device comprises a means for enabling a 
user to change the order and/or the cut/off frequency. Using a 2 nd order high pass filter results 
in good speech intelligibility and/or signal loudness, whereas using a 1 st order high pass filter 
will preserve the more natural sound of the original signal. 

10 Preferably the system, and/or program comprises a means for activation and/or 

setting of the frequency dependence of the low pass filter and/or the high pass filter in 
dependence on the average amplification in the levelling amplification stage. The average 
amplification is a measure for the average gain of the signal and thereby of the loudness level 
of the emitted sound signal. It is advantageous when the cut-off frequency of the high-pass 

15 filter increases with increasing average loudness level of the emitted signal, and the cut-off 
frequency of the low pass filter decreases with increasing loudness level. 

At very high amplification levels (as could happen when the device is used in 
loud, noisy environments) the levelling action is relatively large, this relatively large levelling 
action introduces a relatively large distortion of the signal, i.e. the unwanted, unnatural 

20 overtones of the signal comprise much energy. This leads to a harsh sound. The harshness of 
the sound sometimes and even often, as the inventors have realized leads to the listener 
keeping the loudspeaker, especially in mobile telephones, at some distance from the ear. 
Apart from the fact that holding the loudspeaker at some distance from the ear in itself will 
lead to a considerable reduction in signal to noise, since the signal will be reduced and the 

25 noise will be increased, the mere fact that the sound itself is perceived as harsh in fact means 
a reduction in intelligibility of the message given. For vocal messages the harshness of the 
voice often is a integral part of the message which the person speaking wishes to address the 
listener, sometimes even more important than the actual words of the messages. It is thus 
important for the intelligibility of the message, seen in a concept broader than merely 

30 whether or not words are understood, that a clear "natural" voice transfer is achieved. At 
lower amplification levels the "harsh sound" effect is much less audible. Selectively 
activating or setting the cut-off frequency at a relatively low frequency of the low pass filter 
at high amplification levels reduces the "harsh sound efifecf \ while at low amplifications 
levels the need for use of the high pass filter is less, and not activating the high pass filter 
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may actually lead to a more natural sound In short; at high amplification levels and strong 
levelling action a relatively large part of the amplitude at high frequencies is due to artificial 
artefacts due to the levelling action and removing (in toto or partially) the higher frequencies 
removes the artefacts thus leading to a more natural sound, at relatively low amplification 
5 levels and relatively moderate levelling action, considerably more of the signal amplitude at 
high frequencies is of natural origin and much less due to artificial artefacts, so not using the 
low pass filter (or setting the cut-off frequency at a relatively high frequency is then 
preferable. By selectively activating the high pass filter dependent on the amplification, an 
improved sound reproduction is possible. Instead of activating/deactivating the low pass filter 
10 (i.e. an on/off choice) the filter characteristic of the low pass filter may be made amplification 
dependent, for instance having a filter cutoff frequency of 2 kHz for high amplification (loud 
environmental noise) and Fs/2, or e.g. 4 kHz, for low amplification or no environmental 
noise, or in other words wherein the cut-off frequency decreases as the loudness level 
increases. 

1 5 Alternatively the device may have a means for manually setting the filter 

characteristics of the low pass filter, e.g. a knob enabling the user to set the filter cut off 
frequency to 2, 3 and 4 kHz depending on the (lack of) intelligibility, or personal preference. 
A similar argument holds for the high pass filter, at low amplification levels the need for 
using the high pass filter is less than at high amplification levels, so at low amplification 

20 levels the high pass filter may be deactivated or given a low cut-off frequency level, whereas 
at high amplification level (high loudness) the high-pass filter may be activated or its cut-off 
frequency set at a relatively high value, or in other words wherein the cut-off frequency of 
the high pass filter increases as the loudness level increases. 

In a first embodiment the system comprises the high pass filter followed by an 

25 AGC followed by a limiter/clipper preferably followed by the low pass filter. This 

embodiment is preferred when in circumstances where signal loudness is of prime interest A 
limiter scans for peaks in the audio signal and attenuates the audio portion around the peak if 
the attenuation is necessary to limit the amount of clipping, while yet for very loud signals 
allowing for clipping. 

30 In a second embodiment, the system comprises an automatic volume leveler 

preceded or preferably followed by the high pass filter, providing a levelled signal, followed 
by a gain, and a clipper, preferably followed by the low pass filter. This embodiment is 
preferred when low computational effort is preferred. 
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(Hard) clipping is a simple operation in which any signal above a threshold 
signal strength is reduced to said given threshold signal strength, i.e. a maximum signal 
strength is set The advantage of such an embodiment is that a simple device is used, the 
disadvantage is that the signal is more heavily distorted, since any details in the signal above 
5 the threshold signal are lost 

Within the concept of the invention a 'clipper 1 , 'compressor', 'amplifier', 
'filter', 'converter', 'comparator' etc are to be broadly understood and to comprise e.g. any 
piece of hard-ware (such as a clipper , compressor, amplifier etc), any circuit or sub-circuit 
designed for performing a clipping, compression, amplification etc function as described as 

1 0 well as any piece of soft- ware (computer program or sub program or set of computer 

programs, or program code(s)) designed or programmed to perform a clipping, compressing, 
filtering etc operation in accordance with the invention as well as any combination of pieces 
of hardware and software acting as such, alone or in combination, without being restricted to 
the below given exemplary embodiments. One program may combine several functions. 

15 The invention is also embodied in any computer program comprising program 

code means for performing a method in accordance with the invention when said program is 
ran on a computer as well as in any computer program product comprising program code 
means stored on a computer readable medium for performing a method in accordance with 
the invention when said program is run on a computer, as well as any program product 

20 comprising program code means for use in a telephone system in accordance with the 
invention, for performing the action specific for the invention. 



These and further aspects of the invention will be explained in greater detail 
25 by way of example and with reference to the accompanying drawings, in which 

Fig. 1 is a schematic diagram of a system including a loudspeaker, and a DSP 
Fig. 2 shows schematically a DSP in accordance with the invention 
Fig. 3 shows two examples of high pass filters which may be used in tine 

invention 

30 Fig. 4 shows the waveform of a typical speech signal 

Fig. 5 illustrates the waveform after having been filtered by the high pass filter 
Fig. 6 shows the spectrum of the waveform shown in Figs. 4 and 5 
Fig. 7 illustrates one type of embodiments of the invention 
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Fig. 8 depicts the functional block diagram of an exemplary AGC as 
schematically shown in Fig. 7 

Fig. 9 illustrates a different type of embodiment 

Fig. 10 illustrates in more detail an exemplary AVL element as schematically 
5 indicated in Fig. 9 

Fig. 1 1 illustrates the AVL behavior of the AVL of Fig. 10 

Fig. 12 shows an example of a compression functional relationship. 



10 The present invention will now be described more fully hereinafter with 

reference to the accompanying drawings, in which preferred embodiments of the present 
invention are shown. This invention may, however, be embodied in many different forms and 
should not be construed as limited to the embodiment set forth herein; rather, these 
embodiments are provided so that this disclosure will be thorough and complete, and will 

15 fully convey the scope of the invention to those skilled in the art. Like numbers refer to like 
elements throughout 

Fig. 1 illustrates schematically a sound reproduction system. Such system can 
for instance be a hands-free loudspeaker cellular radiotelephone for use in an automobile. 
When implemented as a hands-free cellular telephone, speech signals received from a far 

20 end, i.e. from a distant party, are transmitted from a cellular base station (not shown), 

received by the transceiver of the cellular phone (not shown), and applied to die input 1 for 
an incoming far end signal as an input waveform W. In this example it is assumed that the 
transmission back and forth between the system, in this example a telephone system, and the 
far end is in a digital form. If the original signals are in an analog form the system comprises 

25 an A-to-D converter to generate a digital far end signal which is then fed into input 1 . 

As shown in Fig. 1, the waveform is applied in a digital format at input 1 of or 
connected to a DSP (digital sound processor) 2, which is connected or which comprises a 
digital output 3. The digital signal output is fed to and converted to an analog format by D-to- 
A converter 4 and amplified by amplifier 5 for use by the loudspeaker 6. A sound pressure 

30 wave Wl representative of the speech of the distant party is emitted by loudspeaker 5. 
Accordingly, the radiotelephone user hears sound pressure waveforms which are 
representative of the speech of the distant party. 
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However, the listener does not just hear the sound generated by the loud 
speaker, but also other sounds, which may make the sound generated by the loud speaker 
difficult to understand, i.e. of low intelligibility. 

Turning up the volume seems a first and obvious choice to increase the 
5 intelligibility. However, the maximum output level of the loud speaker is often limited and 
simply turning up the volume often leads to more noise, not necessarily a better intelligibility 
of the signal. 

To improve the intelligibility a number of co-operating measures are taken in 
the device and method in accordance with the invention. 

10 Fig. 2 illustrates very schematically a DSP (digital sound processor) for use in 

a device in accordance with the invention- The DSP comprises a high pass filter 21 with a 
cut-off frequency between 300 Hz and 2 kHz, preferably between 500 and 1500, more 
preferably between 800 and 1200 Hz. The high pass filter removes or reduces frequency 
components below the cut-off frequency £ 

15 Most of the energy of the signal has thereby been removed. This enables the 

remaining signal to be much more amplified (before running into problems in regards to a 
digital clipping, i.e. a value higher than the maximum value). The amplification is done by 
the compressing amplifier. Within the concept of the invention a compressing amplifier is an 
amplifier which amplifies the signal but also levels the average sound level, i.e. sounds 

20 having a small amplitude are more amplified than sound having a high sound level, thus 
reducing the signal amplitude range. This may be done in several manners e.g. a 
clipper/limiter arrangement, a clipper/compressor or an AVL (automated voltage leveller) 
followed by a gain and a clipper. A number of different techniques may be used, including 
using look-up tables to perform the amplification and compression. The amplitude range, in 

25 particular the upper limit of the range may be set by the manufacturer or influenced by the 
user e.g. by means of a loudness setter (a knob with which the user may set the loudness). 
Various examples will be given below. Compared to a straight forward, linear amplification 
of the signal (i.e. for all sound levels an equal amplification factor) the compressed 
amplification leads to more intelligibility of the sound. The words are more easily 

30 distinguishable, and thus the intelligibility of the sound is improved. It could also lead, 
however, to a distortion of the sound since the non-linear amplification of the sound 
introduced overtones (high frequency components at double, triple etc the original frequency) 
which leads to an increased harshness of the sound. This is perceived by the listener as being 
unpleasant, and in fact, in an important sense, reduces the intelligibility, in a broader sense, of 
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the vocal message, since the harshness of the spoken words often forms an important aspect 
of the vocal message. This effect is present even without clipping, the clipping itself also 
introduces overtones. Intelligibility in a broader sense does not just relate to the words as 
such, but also to the message the speaker wishes to convey to the listener. The harshness of 
5 the sound, especially at higher average amplification, makes everybody sound angry, thus 
strongly reducing the finesses in emotions the speaker wishes to convey. The application of a 
low pass filter, as in a preferred embodiment of the device, depicted in figure 2, after the 
compressing amplifier reduces the perceived harshness of the voice, restoring at least to some 
extent the original emotional content of the spoken words, i.e. giving a much more natural 

10 sound. It is remarked that in most western languages the pitch of a word influences the 
emotional impact of the word, but not the meaning of the word per se. However, there are 
languages in which the pitch of the word plays a much larger role, leading to completely 
different meanings for one and the same 'word" depending on the pitch of the word. When 
such languages are used (which cannot be excluded) the use of the low pass filter becomes 

15 even more advantageous. The invention is in particular of advantage when used in 

conjunction with or in automated speech recognition systems, especially for languages in 
which the pitch of the spoken word influences the meaning of the words. What is above 
discussed in relation to spoken words, i.e. voice, equally applies when the sound reproduction 
system is used to reproduces music. Also in music the way music is perceived is of course 

20 dependent on whether one van hear the notes, but also the harshness of the sound is very 
important The invention is thus, although of great importance to systems in which vocal 
messages are relayed such as telephone systems, not restricted to such systems, systems for 
reproducing music may equally benefit of this invention. 

The levelling or compression action may be performed before HP filtering of 

25 the incoming signal or after it has been HP filtered. The amplification per se (i.e. the gain) is 
done after HP filtering. The clipping is done after the gain or in conjunction. When use is 
made of a low pass filter this is positioned after the clipper. 

Fig. 3 gives two examples of high pass filters usable in a system in accordance 
with the invention. 

30 The left hand side of the figure illustrates a 1 * order filter, the right hand side a 

second order filter. The shown high pass filter have a cut-off frequency f of approximately 1 
kHz. First or second order high-pass filters (which have relatively moderate sloops of 5-15 
dB per octave) are preferred. Removing too much of the low frequencies component results 
in a very unnatural sounding voice (or unnatural odd sounding music) Therefore the order of 
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the high pass filter is preferably limited at 2. This also reduces the computational power 
required. Preferably the user can change the high pass filter from 1 st to 2 nd order and vice 
versa, or the system comprises an automatic switching mechanism dependent on the 
incoming signal. Using the 2 nd order results in high speech intelligibility (in restricted sense, 
i.e. only the words) and/or signal loudness, whereas the 1 * order HP filter will better preserve 
the natural sound of the original signal. 

The HP filter may for instance consist of a biquad whose frequency magnitude 
response is shown in fig. 3 and whose coefficients are listed in 
Table 1, according to the format 

b 0 +b x z~ l +b 2 z~ 2 
l + a t z~ l + a 2 z~ 



Table 1 - Filter coefficients of the HP filters depicted in fig. 3. 





1 st order 


2 nd order 


bo 


0.70710678118655 


0.56903559372885 


bi 


-0.70710678118655 


-1.13807118745770 


b 2 


0.00000000000000 


0.56903559372885 


a\ 


-0.41421356237310 


-0.94280904158206 


02 


0.00000000000000 


0.33333333333333 



15 The effects of application of a high pass filter can be seen in Figs. 4, 5 and 6. 

Fig. 4 shows the waveform of a typical speech signal, figure 5 the waveform 
after having been filtered by the high pass filter (processed by a first order HR high pass 
filter, cutoff frequency at 1 kHz). 

Fig. 6 shows the spectrum of the waveform shown in Figs. 4 and 5, the signal 
20 energy of the waveform of Fig. 4 being depicted in the upper curve, the signal energy of the 
waveform after the high-pass filter being depicted in the lower curve. As can be seen form 
figure 6 the signal energy of the waveform of Fig. 4 is mainly concentrated in the lower 
frequencies (below 1 kHz). 

These frequencies contribute mainly to the specific sound of the voice, but less 
25 to the speech intelligibility. This property forms one aspect of the invention. 

By attenuating the lower frequencies, the signal amplitude will decrease 
significantly, as can be seen in figure 6, creating headroom to amplify the remaining signal 
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which contains relatively more frequencies contributing to the speech intelligibility, as in 
clear from Fig. 6. 

When compressing and amplifying the speech signal afterwards, even when 
clipping it, the speech intelligibility will be better than without the use of the HP filter for 
5 mainly two reasons: 

the signal contains relatively more frequencies contributing to the speech 

intelligibility 

the low frequencies are less hard clipped, resulting in less harmonics (due to 
clipping) disturbing the speech intelligibility 
1 0 Removing too much of the low frequencies, however, would results in a very 

unnatural sounding voice. Therefore, the HP filter is preferably only first order e.g. a 
Butterworth (first order KR) filter. This had the advantage of little computational power 
consumption. 

Fig. 7 illustrates a detail on a system in accordance with an embodiment of the 

15 invention. 

The DSP comprises a HP filter (for instance one as illustrated in figure 3, in 
this example for instance a Butterworth 1 st or 2 nd order filter with cut-off frequency value of 
for instance 1 kHz), followed by an AGC, followed by a limiter/clipper, followed by a low 
pass filter (LP). 

20 In this example all audio streams may be mono. The sample rate frequency 

can be e.g. one of the following: 8 kHz, 1 1 .025 kHz, 16 kHz, 22.05 kHz, 32 kHz, 44.1 kHz or 
48 kHz. 

In this example the AGC (Automatic Gain Control) acts block based, meaning 
that the gain factor only changes per block. In this way the computational power is kept to a 
25 minimum. 

Fig. 8 depicts the functional block diagram of the AGC. The gain is computed 
such that the RMS of the AGC output block is approximately equal to the RMS of the AGC 
input block. Therefore, the relation of the input RMS over the output RMS (this is the target 
value) is compared to the actual gain, and this gain is adjusted towards the target value. The 
30 gain variations must be smoothed to avoid too much signal distortion due to local waveform 
discontinuities at the block edges. 

The limiter/clipper increases the loudness level while keeping signal distortion 
within bounds, preferably as low as possible. 
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In this example the limiter/clipper is a sample-based gain. In common 
compression techniques the gain is dependant on the available signal headroom. 

The limiter/clipper may use a so-called crest-factor to calculate the allowed 
gain. The crest-factor is in fact the relation of the peak value and the running RMS value of 
5 the signals. In the limiter/clipper, signals with a high crest-factor, e.g. a drum beats, are 
clipped without severe audible distortion. Doing this, the gain can remain at a higher level, 
reducing the jumping effect? 9 of common compressors. 

The gain is calculated as follows in this exemplary embodiment: 
First of all, the running RMS (Root Mean Square) value of the input signal is 
10 computed. This RMS value is a smoothed average based on the recent "history" of the signal 
waveform. Then the peak value is calculated using a look-ahead time in order to anticipate to 
upcoming signal peaks. 

With the RMS and the peak value, the crest-factor is computed. A so-called 
"depeak" factor is used to specify how hard the algorithm can clip peaky signals (high values 
1 5 will yield more clipping). Afterwards the gain is computed and is compared with the 

maximal allowed gain, which can be set by the user, and the minimal value of the two is 
chosen. Although not shown here the maximum allowed gain setting can be an input for the 
high pass filter, wherein the cut-off frequency is a function (or more in general one or more 
characteristics of the filter, which could apart from the value of the cut-off frequency also or 
20 alternatively e.g. include switching from a first order to a second order) of the m a ximum 
allowed gain setting. 

The low pass filter LP filters the high frequencies with a more natural sound as 
a consequence. Although not shown here the maximum allowed gain setting can be an input 
for the high pass filter, wherein the cut-off frequency is a function (or more in general one or 
25 more characteristics of the filter, which could apart from the value of the cut-off frequency 
also or alternatively e.g. include switching from a first order to a second order) of the 
maximum allowed gain setting. 

Figs. 7 and 8 illustrate one type of embodiments of the invention, fig. 9 and 10 
are directed to a different type of embodiments. 
30 Basically these embodiment comprise 4, preferably 5 elements or steps: 

1 . an AVL (Automatic Volume Leveller) : the AVL is a signal dependant 

processing block, keeping the volume of the incoming signal at an approximately constant 
level, 
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2. a first order HP (high pass) filter : this filter removes a part of the lower 
frequencies, creating headroom for amplification, 

3. a gain : increasing the SPL (Sound Pressure Level) of the signal, 

4. a clipper, preferably a hard clipper when a simple device is preferred: the 

5 signal is clipped at a certain amplitude, to assure linear operation of the analogue amplifier 
(after D/A conversion). Instead of a hard clipper, which simply clips the signal above the 
clipping level, a soft clipper may also be used, which clips the signal above a clipping level 
but also attenuates the signal at level close to the clipping level. Using a soft-clipper restores 
to some extent the dynamic behavior of the signal, increasing intelligibility, 
10 5. (preferred) a first order LP (low pass) filter : the filter restores or at least 

improves the balance between mid-range frequencies and high frequencies is unnatural, and 
the signal sounds rather harsh; this filter makes the processed sound more pleasant to listen 
to. 

In this example the input is a speech input, but it is remarked that the input 
1 5 may be any sound signal 

Fig. 1 0 illustrates in more detail the AVL element. 

Fig. 10 shows the block diagram of the AVL. Division of the maximum output 
amplitude (called <€ Reference_CTppmgJLin" in Fig. 10) by the absolute value of the 
itifreimnm amplitude of the signal across the entire input block yields the gain that can be 

20 applied to the block without any clipping (called 'Instant gain"). The parameter 

"GainUpdate" is typically 1/1 000 th . Thus, the signal after the sum is increased each block 
with an amount depending on the 'Instant gain 9 '. Doing so, the AVL gain will be increased 
fester for higher values of "Instant gain'*. 

Since the eventual gain applied to the input signal ("AVL gain") should not be 

25 higher than ("Instant gain"), the minimum of both is taken ("Mm"). The minimum block 

also assures that the AVL gain is decreased immediately when the incoming signal amplitude 
increases. To prevent that silent parts in the input signal are amplified too much, the signal 
after the minimum block is limited (typically 12 dB). Note that, since the maximum output 
amplitude is not necessarily equal to the linear level representing 0 dB, the AVL gain can be 

30 smaller than 1. 

Last, if the amplitude of the input signal is smaller than a certain value, for 
example the signal noise floor, no gain will be applied. This is a secondary functionality, in 
preferred embodiments, which can also be used as a functionality in other types of 
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compressing amplifiers to prevent small input signals from being amplified, noise is men not 
amplified, which increases the intelligibilily of the sound. 
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Numeric example: 
sample rate is 8 kHz 
block length is 80 samples 
5 - word length is 16 bit 

maximum output amplitude is 32768 (0 dB) 

input signal is sinusoidal with amplitude 32768 (0 dB), frequency >= 100 Hz 
(at least one sine period per block) 

current AVL gain is 1 (0 dB) 
10 - GainUpdate = 0.001 

AVL gain ("Old value") is limited at 12 dB 

Now, the input signal amplitude decreases by 40 dB (factor 100 on a linear 
scale). The table below shows how the different signals inside the AVL change with time. 
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The AVL behavior is also shown in Fig. 11 . 

The left graph shows the step change of the amplitude of three input signals. 
The right graph shows that the AVL gain increases fester for large changes in amplitude. This 
20 is a preferred embodiment further improving intelligibility. 

Fig. 12 shows an example of a compression function. Upto a certain value the 
signal is left untreated (multiplied by one) and above that value the outputted signal 
amplitude is smaller than the inputted signal amplitude, i.e. the signal is compressed. A 
functional form which only applies compression in the highest regions as in Fig. 12 may be 
25 used for the clipper, whereas a functional form which provides a compression strategy over a 
larger region (e.g. the whole input signal range) may be used for the compressing amplifier. 
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The invention may be used in various devices. The invention is in particular 
useful for hands free mobile phones. However, it is applicable for all sound reproduction 
systems, especially those which run on a device with a limited voltage supply and/or small 
loudspeaker. A list of possible applications: 
5 - handsets (mobile phone, DECT, etc.); 

portable devices, e.g. a portable DVD player; 

PDA'S; 

car-kits; 

TVs; computers; 
10 - web-terminals; 

answering machines. 

It will be appreciated by persons drilled in the art that the present invention is 
not limited by what has been particularly shown and described hereinabove. The invention 
resides in each and every novel characteristic feature and each and every combination of 
15 characteristic features. Reference numerals in the claims do not limit their protective scope. 
Use of the verb "to comprise" and its conjugations does not exclude the presence of elements 
other than those stated in the claims. Use of the article "a" or "an" preceding an element does 
not exclude the presence of a plurality of such elements. 

The present invention has been described in terms of specific embodiments, 
20 which are illustrative of the invention and not to be construed as limiting. The invention may 
be implemented in hardware, firmware or software, or in a combination of them. Other 
embodiments are within the scope of the following claims. 

In short the invention can be described as follows: 

A sound reproduction system comprises a digital audio signal input (1), a 
25 digital audio signal processor (2, DSP) and a digital audio signal output (1) wherein the 

digital signal processor (2, DSP) comprises a high pass filter (21) with a pass frequency (f) of 
between 300 Hz and 2 kHz, a compression amplifier (22) for compression and amplification 
of a signal to within a signal amplitude range and a clipper for clipping the signal above a 
clipping level, and preferably a low pass filter (23) for filtering the signal provided by the 
30 compressing amplifier and for providing an output signal, the pass frequency of the low pass 
filter (f ) lying in the range 2 kHz-Fs/2 where Fs is the sampling frequency. 

Fs/2 may be e.g. 4 kHz. 

Many variations are possible within the concept of the invention. 
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CLAIMS: 



1. A sound reproduction system comprising a digital audio signal input (1), a 
digital audio signal processor (2, DSP) and a digital audio signal output (1) wherein the 
digital signal processor (2, DSP) comprises a high pass (HP) filter (21) with a pass frequency 
(f) of between 300 Hz and 2 kHz, a compressing amplifier (22) for compression and 
amplification of a signal, at least amplification being performed after HP filtering, and a 
clipper for clipping the HP filtered, compressed and amplified signal above a clipping level. 

2. A sound reproduction system as claimed in claim 1 , wherein the compressing 
amplifier is arranged not to amplify a signal having a signal strength below a threshold value. 

3. A sound reproduction system as claimed in claim 1, wherein the digital audio 
processor comprises a low pass filter (23) for filtering the signal provided by the compressing 
amplifier and for providing an output signal, the pass frequency of the low pass filter (f) 
lying in the range 2 kHz-Fs/2 where Fs is the sampling frequency. 

4. A sound reproduction system as claimed in claim 1, wherein the high pass 
filter is a first order or second order filter. 

5. A sound reproduction system as claimed in claim 1 , wherein the system 
comprises a means for activation and/or setting of the frequency dependence of the low pass 
filter in dependence on the amplification in the compressing amplifier. 

6. A sound reproduction system as claimed in claim 1 , wherein the system 
comprises the high pass filter followed by an AGC followed by a limiter/clipper. 

7. A sound reproduction system as claimed in claim 1 , wherein the system 
comprises an automatic volume leveller preceded, or preferably, followed by the high pass 
filter, providing a levelled signal, followed by a gain and a clipper. 
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8. A method for processing digital sound signals in which method frequency 
component of the sound signal lower than a cut-off frequency f between 300 Hz and 2 kHz 
are attenuated, the sound signals axe amplified and compressed to within a signal hand width 
and clipped above a clipping level within the signal band width. 

5 

9. A method as claimed in claim 8, wherein after compression and clipping 
frequency components of the resulting digital signal below a cut off frequency f between 2 
and 4 kHz are attenuated. 

10 10. A method as claimed in claim 8, wherein activation and/or setting of the 

frequency dependence of the low pass filter is performed in dependence on the amplification 
in the levelling amplification step. 

1 1 . Computer program comprising program code means for performing a method 
15 as claimed in any one of claims 8 to 10 when said program is run on a computer. 

12. Computer program product comprising program code means stored on a 
computer readable medium for performing a method as claimed in any one of claims 8 to 10. 
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ABSTRACT: 



A sound reproduction system comprises a digital audio signal input (1), a 
digital audio signal processor (2, DSP) and a digital audio signal output (1) wherein the 
digital signal processor (2, DSP) comprises a high pass filter (21) with a pass frequency (f) of 
between 300 Hz and 2 kHz, a compression amplifier (22) for compression and amplification 
5 of a signal to within a signal amplitude range and a clipper for clipping the signal above a 
clipping level, and preferably a low pass filter (23) for filtering the signal provided by the 
compressing amplifier and for providing an output signal, the pass frequency of the low pass 
filter (f ) lying in the range 2 kHz-Fs/2 where Fs is the sampling frequency. 

Fs/2 may be e.g. 4 kHz. 

10 

Fig. 2 
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